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Abstract 

This paper studies the threshold estimation of a TAR model when the underlying 
threshold parameter is a random variable. It is shown that the Bayesian estimator is 
consistent and its limit distribution is expressed in terms of a limit likelihood ratio. 
Furthermore, convergence of moments of the estimators is also established. The limit 
distribution can be computed via explicit simulations from which testing and inference 
for the threshold parameter can be conducted. The obtained results are illustrated with 
numerical simulations. 
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1 Introduction 

Since the publication of the seminal treatise of Tong [18], the field of nonlinear time series 
has been receiving considerable attention in the literature. Today, nonlinear time series has 
been widely applied to subjects such as ecology, engineering, chaos, finance and econometrics. 
Prom a statistical perspective, nonlinear time series also furnishes an exciting platform for 
nonstandard statistical inference both parametrically and nonparametrically. For a compre- 
hensive survey on some of these recent developments, see Fan and Yao [9]. 

Among many different developments in nonlinear time series, estimation and testing of the 
threshold parameter constitute one of most challenging tasks. One of the main reasons of the 
difficulty arises from the fact that tricky and nonstandard asymptotic techniques are required 
to handle the threshold estimation, see Chan [3j, Hansen [11] and [12]. A comprehensive 
theory for this type of problems seems to be lacking from the literature so far, however. 

On the other hand, a relative complete theory for the statistical inference for diffusion 
processes in continuous time is available, see for example Kutoyants [14] and [15]. In par- 
ticular, these two books demonstrate that both the maximum likelihood and the Bayesian 
approaches to diffusion processes can be put under a general context and an asymptotic 
theory can be developed, albeit to its non standard nature. 

One of the main purposes of this paper is to make use of this general theory and apply 
it to the nonlinear time series context. Related contributions to continuous time ARM A and 
threshold ARMA models can be found, for example, in Brockwell [2], Chan and Tong [5], 
Stramer, Brockwell and Tweedie [17], and Tong [18] and the references therein. 
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Although likelihood inference for the threshold parameter of nonlinear time series was 
considered by Chan [3] and Hansen pT] previously, the asymptotic machineries employed 
were of special nature which cannot be easily generalized to other situations. For further 
background on likelihood tests of non-linearity, see Li and Li |16j . From a Bayesian per- 
spective, Geweke and Teuri [10] considered a Bayesian threshold AR model and derived the 
posterior distribution of the threshold parameter. However, a detailed description of the 
asymptotic properties of the Bayesian estimator and its moment convergence were lacking. 

By incorporating the developments in diffusion, this paper illustrates a general method- 
ology to tackle both the maximum likelihood and Bayesian estimation problems from which 
simulations can be efficiently conducted. Moreover, the proposed approach is sufficiently 
transparent and can be easily adopted to other nonlinear time series context. 

A second but equally important goal of this study is to develop an implementable scheme 
for simulating and computing the limit likelihood statistics. By linking the integral equation 
of the underlying invariant density of the nonlinear time series and the intensity of the limiting 
compound Commission process, one can compute the form of the limiting likelihood explicitly. 
To the best of our knowledge, this has never been conducted before and results obtained in 
this paper can greatly enhance the inference for the threshold parameter of a nonlinear time 
series and extend its applications. 

This paper is organized as follows. Background introduction together with the statement 
of the problem and the main result are given in Section 2. Section 3 consists of simula- 
tions. Section 4 discusses the extension to cover the usual one-sided threshold setting while 
conclusions and possible extensions are given in Section 5. 

2 Main result 

Consider the model 



We choose the current setting because it is more general and mathematically more convenient. 
Our results can be easily extended to encompass the standard setting as demonstrated in 
Section 4.1. We suppose that a 2 > 0, p\,Pi are known and # € = («,/?) is the unknown 
threshold parameter. Our goal is to estimate i? from observations X n = (Xq, Xi, . . . , X n ) 
and to describe the asymptotic behavior of the estimators as n — > oo. Recall that {Xj)- >1 is 
geometrically mixing (see Chen and Tsay [6]) and denote its stationary density function by 
/ x), see also Fan and Yao [9]. 

In this paper, we consider both the maximum likelihood and Bayesian approaches. Recall 
that the likelihood function is written as 




L 
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and the maximum likelihood estimator (MLE) i? n is defined by the equation 



sup L (■&, X n ) = max L I i!) n +,X n ) ,L [ i? n — , X n I . (2) 

If this equation has many solutions, we can, for example, call the MLE to be the value which 
is at the center of the gravity. Note that the function L X n ) , •& G has jumps at the 
points 

tf^LXjlee, Z = l,...,L, 

where L < n. Clearly, if = R, then L = n. 

To apply the Bayesian approach, suppose that the unknown parameter is a random vari- 
able with a known prior density p(0) ,9 G 0, which is continuous and positive. Using the 
quadratic loss function, the Bayesian estimator (which minimizes the mean squares error) is 
the conditional mathematical expectation 

L / a %(9)L(#,X»)d9 

Properties of the least squares estimator (LSE) of •& were studied in Chan [3]. The LSE 
coincides with the MLE for Gaussian £j . We therefore recall properties of MLE and compare 
them with properties of the Bayesian estimators. 

First, introduce the stochastic process 



Z{u) 




2 ,,2 



W-^ES (M ] et}, u>0, 



where N + (•) and iV_ (•) are two independent Poisson processes of intensities A+ = A_ = 
2/ (/($, x) is the stationary density function of Xj) and £^,£ ; ~ are independent Gaus- 

sian Af (0, cr 2 ) random variables. It is easy to see that 

N+(u) N+(u) 

Y + (u) = p 2 tf 2 N + {u) + 2p$ ef = [p 2 tf 2 + 2p$e+] , u > 

z=o z=o 

iV_ («) iV_ (m) 

Y_ (u) = p 2 $ 2 iV_ (u) + 2p$ e T = Yl [P 2 ^ + 2 P& e T] ' n ^ 

are compound Poisson processes. 

The random process Z (•) is piecewise constant and as a result, the points u* of the 
maximum of the process Z (•) is defined by 

sup Z (u) = Z (u*) , 

where 

u m < u* < um- 
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Here u m and % are two consecutive events of the process N + (•), or of the process N_ (•), 
or they are respectively the first event of N + (•) and N_ (•). Simulated realizations of Z (•) 
are given in Section 3. The center of gravity of the interval is given by the point 

u m + u M 

u = 2 • W 

Such a choice of u is explained in Section 4 below. It follows from the result of Chan [3] that 
the MLE i? n is consistent and 

n {&n — "&\ u. 

Introduce the random variable 

J uZ (u) du 

u = . 

J Z (u) du 

The main result is the following theorem. 

Theorem 2.1. The Bayesian estimator fin constructed by the observations X n of the thresh- 
old autoregressive process is consistent, the normalized difference n (d n — i^j converges in 
distribution : 

n (ti n - d) u (5) 



and for any p > 



lim E# 

?1— >QO 



n (d n - i?) [ = E* \u\ p . (6) 



Proof. The proof of this theorem is based on the general result by Ibragimov and Khasminskii 
|13j . Theorem 1.10.2. To apply it we study the normalized likelihood ratio process 

L ($ + ^,X n ) 

Zn («) = L{ ^ xn) > ueU n = [n(a-#),n(P- ■&)] , 

where i? is the true value. Recall the main steps. Write the Bayesian estimator (0 U = $ + ^) 
as 

~ _ jy P (e)L(9,X n )d9 | i f Vn up(8 u )L(e u ,X n )du 
n f a p (6) L (0, X n ) d9 n J Vn p (9 U ) L (9 U , X n ) du 

_ ^ 1 Jvn U P ^ L wx») dM _ tt 1 /u„ up (fl u ) Z n (n) du 

n k n p (««) ffeSy d ^ n /u B P (*«) ^ («) du ' 

Suppose that we proved the convergence of the process Z n (•) to the process Z (•) providing 
the convergence of these integrals. Then 

/ ~ \ fuZ(u) du _ 
nfo-tfj=> Jz[u)du =u. 

This convergence together with an estimate on the large deviations of the tails of the process 
Z n (•) allow us to prove the convergence of the moments (J6]). 

Now check the conditions of the Theorem 1.10.2 in |13j . We need to prove 
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1. the convergence of the finite dimensional distributions of Z n (•) to the finite dimensional 
distributions of Z(-), that is, 



Z n (•)"►£(.) f.d.d. 



2. to establish the estimate: 



E, 



ZV2 (u2 )_ Z l/2 (ni) 



< C \ll2 — U\\ 



3. and to establish the estimate: for any M > 



(7) 



(8) 



(9) 



The convergence of finite-dimensional distributions follows from the Proposition 2 of [3]. 
Instead of repeating a technical argument as in [3], we offer a different intuitive (but rigorous) 
explanation as follows. Rewrite the process ([1]) as 

Xj+i = PiXj + pXj l{\ X j\>ti} + J = 0, ...,n-l, (10) 

where we use I{|x,|<i9} = 2 — I{|x,|>#} an d denote p = pi — p\. 

Put Z n (u) = exp { — -^jY n (u)} and study the process Y n (u) for positive values of u. 

n-l 

3=0 



Xj+i - pi Xj - pXj Iji^i^+i} 
- [Xj+i - pi Xj - pXj 1{| Xj |>^} 



n-l 
3=0 



x [2X j+1 -2piXj - pXj 



It 



{\X 3 \>^} + ^{\ Xj \>^} 



Note that 



Hence, 



n-l 



y n (u) = JpXj [2X i+1 - 2 Pl Xj - pXj] 1 



3=0 
n-l 



{*<l*il<*+s} 



= E (p 2 X J + 2 ^ X i 1 {a<\x 3 \<#+±}- 
3=0 

Next introduce another process 

n-l 

(«) = £ (P 2 ^ 2 + 2p^sgn (X,) ej - +1 ) l { ^ <|Xj|< ^ +f } 
3=0 



(11) 



(12) 
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and put I| ]?< | X | <1?+!£ | = l{Bj (u)} • We show that this process is asymptotically equivalent 
to the process Y n (u). We have 

n-l 

Etf \Y n (u) - Y: (u)\ <p 2 Y,V# \X] - tf 2 | I {B . (U)} 

j=0 
n-l 

+ 2pJ2Vo\X j -#sgn(X j )\ \e j+1 \ I {B , (u)} . 

j=0 

For the first term we write 

Etf \X] - tf 2 | 1 {M {U)} = I |x 2 - tf 2 | /(0,x) dx 

•A?<M<*H-^ 

<2^+-U-) 2 max f{0,x)<C (-Y . 
V n/ Vn/ 0<| x |<0+^ Vn/ 

The second term is (recall that Xj and Ej + \ are independent) 

E tf |Xj - i?sgn(X,-)| |e j+ i| 1{b 3 («)} 

= - sgnpfj)| l {Bj(u)} E^ 




a E,, |Xj - d sgn (X,-)| 2{b,(«)} 



= 2*/-(t / |x- t?| dx < C (-) 

V vr J^< x <^+!l Vn/ 

— — r>. 



Hence, 



u 2 

^\Y n (u)-Y° n (u)\<C -^Q 



for any fixed u. Therefore, it is sufficient to study the limit distribution of the random 
function Y° (u) and to show the convergence 

N+(u) 

Y° (u) Y + (u) = Y, [P 2 ^ 2 + 2p ^ e+] • (13) 
z=o 

To see that the limit of Y° (u) is a compound Poisson process, first note that the charac- 
teristic function 



$ (v) = E^e^+W = E^^V^P^ 



E^E^ 

E e ["V 2 i? 2 -2t) 2 p 2 i? 2 a 2 ]AT+(«) 

expju ( e »P 2 ^-^ 2 P 2 ^^ _ ^ 2/ (0,0)} , (14) 
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where we denote J~n + to be the cr-algebra related to the Poisson process and make use of the 
independence of and N + (•). The desired convergence will be proved if the convergence of 
the characteristic function of the process Y° (•) to (|14|) is established. 

Fix u > 0, then as n — > oo the band \$, i? + ^1 becomes narrower and the events, when 
\Xj t | € $ + ^] , become more rare. This means that the distance between two consecutive 
events G ^] and |X, !+1 | G tends to infinity. As the process (-Xj) ?>1 is 

geometrically mixing, these events become asymptotically independent. Under such circum- 
stances, the characteristic function <£ n (v) = 'E$e wY ™( u ^ can be calculated explicitly as 



$ n (u) = E# E# exp < 



n-l 



(p 2 i? 2 + 2ptfsgn(X i ) e j+1 ) I {Bj(u)} 



i=o 



-Fx 



n— 1 



Further, 



= E* exp ^ ( iv P 2 ^ ~ 2v2 P 2 $ 2 ° 2 ) %»} 
= (E^exp{(iV^-2^V^ 2 ^ 2 ) I { B l( n)}}) n - 

E* exp { (ivp 2 ^ 2 - 2v 2 p 2 °d 2 a 2 ) l {Bl («)}} 



+ 



+ 



/ (i?, x) dx 



+ 



+ 



D ivp 2 -d 2 -2v 2 p 2 -d 2 a 2 



f x) dx 



n n \nJ 



Hence, 



In (v) = n In ( H — | e 

n 



n ' wpH 2 -2v 2 p 2 & 2 a 2 



12/ (0,0) + o 



n 



u e 



,ivp 2 tf 2 -2v 2 p 2 i? 2 (T 2 



1) 2 f (■&,#) = In *(«) . 



□ 



That is, it coincides with p^j) and as a result, (J7|) is proved. 

To establish conditions ([8]) and ([9]) we need the following two lemmas. 

Lemma 2.1. There exists a constant C > such that for all values ui,U2 £ (n (a — t?) , n (/3 — $)) 
we Zioue i/ie inequality 



eJzV 2 (u 2 )-zI/ 2 ( Ui )) <c 



\U2 - Ui\ 



Proof. The first step is 



E* (z x J 2 (u 2 ) - Z]j 2 ( Ul j) = 2-2E#[Z n (u 2 ) Z n (m)] 1 



/2 



2-2E,« 



Z n (Ug) 

_Z n (m) 



1/2 
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where the measure is changed from P$ to ui . We have (u2 > u\ > 0) 

n 



E 



Z n (^g) 

E# x exp 



1I/2 



n-l 



"4cj 2 



j=0 



Note that 



1j 



)(«2)} 



?Ol)} 



Further as 1 — e x < x, 



E, 



ZV 2 {u 2 )-Z l J 2 ( Ul )\ 2 
{ 1 

= 2 - 2E tfl exp I Y, [P 2 X J + 2 P X 3 2 1 



^ { E [P 2 X l + 2 P X i £ i+i] 

k i=o 

,,2 



n-l 



—^ 1 X j l {Cj} = ^ 



np 



2rr 2 n 
< C \u 2 — ui \ . 

We see that © is fulfilled. 



1i2\ 2 U2 — U\ 



n 



x<2 f H — — , ic ) dx 



/(*4A* + )+/(* + ->*- 



(15) 
□ 



Lemma 2.2. For am/ p > i/iere exists a constant C = C (p) > suc/z i/iai for all values 
ti£ (re (a — i?) , n (/? — $)) roe Ziawe the inequality 



\uy 

Proof. We have to study the following expectation 

EtfZV 2 (u) ^expf-^ [ p 2 X 2 + 2pXj £ . +i \ J 

Start with the probability 

PtfjlnZy 2 (u) > -c|u|} =P* j--^y n (u) > -c|u 



j n— 1 
j=0 



8 



Write 



P.; { ~^2 Y n(u) > ~\U\ > - P, ■ 



n-l 



)} 



J=0 



j n—l ^ n— 1 

"^2 2 P 2 X j %»} - 1^2 £ e i+i ^(u)} > " 



i=o 



4a 2 



i=o 



n-l 



< Ptf 



l^P X 3 E {B J («)} >— S-M 



J"=0 
n-l 



n-l 



3=0 3=0 

For the last probability, by Markov inequality we have 

^ n— 1 ^ n—l 

P,9 I "^-2 P 2 ^-(u)} ~T^J2p X j e 3+ 1 I {B, («)} > 



32ct 2 



i=o 



cm 



3=0 



because 



[ 1 n_1 1 n_1 I 

exp I p 2 X 2 l {Bi(u )} - 7-2 ^ P^i I {B J («)} > = 1 



3=0 3=0 

The last equality follows from the following property of the conditional expectation 



E,? exp 



■^a^-il{Vi(u)} ~ 4^2 e ™ I {B„-i(«)} 



^n-l 



exp 



P X n-1 %„-!(«)} 

32a 2 



Etf exp 



pX n _iI {Bra _ l(M)} 
4a 2 ' 



•F-i n- 



1. 



Hence, it is sufficient to study the probability 

n—l 



Ptf < 



^X|1 {B . (U)} > -Cl |u| 



> - 



where ci = 16ca 2 p 2 . 

Fix some k€ (0,1) and consider first the local values u satisfying the condition \u\ < n K . 
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Suppose that u > (for u < the consideration is similar). Then we have 
n— 1 n— 1 

E ^ - *J] %»} - ^ 2 E * - c i 

, i=o i=o 

I n— 1 n— 1 

p* E W - ^ 2 ] + ^ 2 E ^ c i 



P ,9 < 



j=0 
n-l 



<P^<J ^I {Bj(u)} <c 2 ^ 
j=0 



because 



X? - 2 



i=o 



- 2L 

02 ~ tf 2 ' 



> 0. Recall that the sum E^™ =0 I{B(u)} converges to the Poisson 
process of intensity A = 2/ hence the last probability has to be small for the values 

c 2 < A. Let Yj = !{Bj(u)} - E^l{ Bj ( u )} and note that 

E^I {B (u)} = / f(#,x)dx = - \f U, + f U, tf 2 ) 

J J#<\x\<#+% n I \ J V ) 

= 2- f(0,#)(l + o(l))>-f (&,■&) , 
n n 

where the last inequality holds for n > n\. Recall that the function / x) is even and 
/(■&,-■&) = /(#,#). Further, 



n-l 



p <? { E %»> - ° 2 u \ - Pt 

3=0 



n-l 

3=0 
n-l 

E^ 



> (/ (0, u 



E, 



> c 3 -u > < 



2p 



,2p.. 2 p 



i=o 

where we chose such c that c% = f i9) — 16ccr 2 /9~ 2 ??~ 2 > 0. To estimate the last expectation 
we apply the inequality of Dedeker and Doukhan (see (8.1) in [7]): 





n-l 


2p i 




E^i 






3=0 





n-l 



3=0 



(16) 



Write 



Etf (Yj\ Jb) = E^ (E^ {Yj\ T 
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and let g (x) = p x xl^^y + p 2 xl^^y, then 

Etf (Yj\ Tj-x) = E# Cl{ Bj ( u )} Tj-ij - E^I{ Bj .( u )} 

ip (x) dx + 



E <> { 1 {#<\9(X j _ l )+e j \<#+%} 



E i?l{Bj(ti)} 



#<lffPO-i)+z|<#+- 



#<M<1?+- 



/ ($, x) dx 



V 

n 
u 

+ - 



<p[ti-g(X j - 1 ) + -)+<p[# + g(X j - 1 ) 



u 



n 



11 



where ip (•) is the density function of the Gaussian r.v. Ej, i.e., cp(-) ~ (0, <7 2 )) and u,u,u 
are some values between and u. Note that the function A (x) defined by the last equality 
is bounded and E&A (X,_i) = 0. 



Hence 



u 



E*(Y-|7b) = - E tf (A(X,-_i)|7b 



n 



and 



n-l 



n-l 



^ [E* \Y E„ (Y,| Jb)^* = I E ^ l y o E * ( A ^H* 



i=o 



n-l 



<(7^«0--l) = C^ 7 ^<C^ 
n n z — ' n 

j=0 j=0 



3=0 
n-l 



where we used the geometrical ergodicity of (Xj).^: a (J) < 7 J , < 7 < 1 (see [6]) and the 
inequality of Ibragimov 

WE#(A(X j )\F )--E#A(X j )\\ p 

<C\\E# (A (Xj)\ Jo) - B#A (X,) \\l /p <Ca {j) 1/p 

(see Bradley p], Theorem 4.4, (a2)). 

Finally, we obtain (for \u\ < n K ) the estimate 



P,9 < 



n-l 



c 

£ !{%(«)}< 02 M ^ < |^jp. 

J'=0 



Consider now the case \u\ > n K . Of course, \u\ < (/3 — a) n. We have 



n-l 



j=0 



< P, 



n-l 



j=0 



> ci |u| 



(17) 
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because 



i?<b|<i9+-L^ 

1 1 n 



x 2 f 0&, x) dx > a 2 — inf / x) > — Id 

n a<x</3 n 



and the constant c is chosen such that C4 > 2c\ . The last probability in (I17p can be estimated 
with the help of the following lemma. 

Lemma 2.3. (Rosenthal's moment inequality) Let {Zj)- >1 be zero mean mixing series sat- 
isfying the condition: there exist e > and c £ 2N, c > 2p > 2, such that 



^ (r + If 2 [a (r)]<=+ e < 00, 



(18) 



r=l 



where a (r) is the a-mixing coefficient, then 

2p 



E 



3=1 



< c 



n 



E \Z 1 \ 2p+E ) 2p2 " £ + n v (EZ 2+£ )^ 



(19) 



For the proof see [8j, p. 26. 

As the process Xj is geometrically mixing (see [9], Theorem 2.4), hence condition (|18p is 
fulfilled with any c > and e > 0. We apply (fl~9|) with 



Z,- = X? I 



E,?X? I; 



i ^{Bj(u)} - R {B, J (u)}- 

Obviously, e n (it) = E^X? I{b^(«)} — /^ 2 an d e n (u) < C We suppose for simplicity that 

■u > 0, 



/)9<|a;|<i?+S 



+ e n (u) 2p+£ fl- jf 



- e n (it) 1 2p+e / (1?, x) dx 



■&<\x\<&+ 1 



f (1?, x) dx 



< Ci - + C 2 

n 



2p+e 



and similarly 
Hence, 



E,j |Zi| 2+£ < C3 — h C4 



2+E 



E, 



n-1 



3=0 

2p 



2p 



< C \n 



2p 

12/, + E 



?? 



2p 

2 + e 



2p 



C I n 2 p+= Id' 1 2 p+ £ + n 1 2p |n| 2p + n 2 '+<= |d P 2 += + 



V 



< C ( Id «< 2 p+=) +1 2 p+= + \u 



l-2p 



+2p 



+ d K(2+£) 



+ ld P <C Id 
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where we have used the relations n < \u\ < (/3 — a) n and have chosen sufficiently 

small e (or sufficiently large p). 
By Chebyshev inequality 



n-l 



j=0 



C 

> Cl \U\ } < -7—Tp. 

\u\ 



The estimates obtained above allow us to write the following expression: for any p > 1 
and all u 6 (n (a — $) , n (/3 — #)), there exist constants c > and C > such that 



u) > e 



J m 



(20) 



For the expectation, note that 



E ^n /2 («) = E*^ («) 2 {z y 2(n) > e - f N} + E,ZV 2 («) I {4 / 2(u)<e -f H} 



< 



(v#Z n {u)¥$[z n (u) >e- c l"l}) 



1/2 ci | C 



P/2" 



Recall that this estimate is valid for any p > 1, hence ([9]) is verified. Therefore the required 
conditions are fulfilled and the Bayesian estimate satisfied all of the properties stipulated in 
Theorem 1 (see Theorem 1.10.2, p3]). □ 



3 Simulations 



We obtain the density functions of limit distributions of the MLE and Bayesian estimators 
by the following simulations. The limit likelihood ratio is 



Z (u) = exp 



1=0 



2a 2 



for u > and 



Z (u) = exp 



2 q2 a N -(- u ) 

1=0 



2a 2 



(7- 



for u < 0. Here N + (•) and iV_ (•) are independent Poisson processes of intensity A = 2/ (i9, $) 
and the Gaussian random variables e^~, ej ~ A/" (0, cr 2 ) are independent, = 0. 
Denote 



7 



a 



a 



u = -, z/± («) = iV± 



Then the Poisson processes v+(v) ,v > and (v) , > have intensity 1 and the limit 
likelihood ratio 



Z 7 («) 



exp 1 7 £ 
exp { 7 £ 



!/_(-«) _ 



(-*)} 



if u < 0, 
if v > 0. 
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Now the limit process Z 7 (v) only depends on one parameter (7) and the limit random 
variables u and u can be written as 

- _ J -oc v z i (v) 
U ~ A' U ~ A ' ^" rj 7 (,)d, ' 

in obvious notation. 

The next problem is to find the function / x), where / (i?, x) is the stationary density 
function of Xj. As 

Xj+x = pi Xj + pXj I{|jtjl>i?} + 
where Xj and Sj+i are independent, we obtain the convolution equation 

f (0, y) = -j== J g (<?, x) e -i/^(y-«) dx . 

Herein, we denote the density function of p\ Xj + pXj |>tf} by g (1?, x). This density can 
be expressed as a function of / (•), which is a solution to a corresponding integral equation, 
see also Chan and Tong [3]. Specifically, observe that 

g(0, X ) = -fU-) l, w x +±fU±) W. V 
Pi V Pi J Xn^S 92 V 02/ in^S 

Hence, the integral equation is 



-f U - ) 11 

pi V pi 



+—/(*, — ) lr| t k,,i <^(y-x)dx 

P2 V P2/ i«^;J 

00 

/ [y (y - xpi) $-{\x\<#} + <p{y- xp 2 ) %{\x\>&}] dx - ( 21 ) 

-00 

Solution to this equation at the point $ is the intensity A = 2/ (1?, 1?) of the corresponding 
Poisson processes. Therefore the value / (1?, $) satisfies the integral equation 

/oo 
[p(#-a;pi) l { \ x \ < # } + (p($-xp 2 ) l { \ x \>#y]dx, 
-00 

where 99 (•) is Gaussian Af (0, a 2 ) density. We see that / (1?, 1?) > 0. To visualize the properties 
of the sample path Z(u) and the invariant density f(9, ■#), we conduct a simulation experiment 
by taking ef to be i.i.d. standard normal random variables. The parameters used are 
# = 2, pi = 0.15, P2 = 0.95, a = 1 and A = 0.5. A Gaussian kernel is used to estimate the 
form of f(8,2) based on 50,000 observations of Xt generated from model (|10p with e% being 
i.i.d. Af (0, 1) random variables. The plots of Z(u) and f(6,2) are given in Figures 1 and 2 
respectively. 

Figure 1. 
Figure 2. 
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For the maximum likelihood estimate, note that the maximum values of Z (u) form an 
interval [u m , um] with length \um — u m \ = rj, where r/ is an exponential random variable with 
probability density 2f^e~ 2 ^ x ,x > 0, f# = /(#,#)■ We can take any value of u from this 
interval, the middle point u = " M +" m ; say. To have its density function we only need to 
simulate the exponential and the Gaussian independent random variables which will generate 
ui, . . . un- The historgram of it based on 20, 000 simulated values of u is plotted in Figure 3. 
As can be seen clearly, the MLE performs reasonably well and converges to zero very fast. 
The sample mean of the simulated u is 0.01 with a standard deviation 0.033. 

For the Bayesian estimators we first calculate the integral 

f°° f Ul+1 -V-» 2 r 

J+= uZ(u) dn = 2^ / u e ^=° £r du 

Jo l=0 J Ul 

1=0 

Here e r ~ M (0, a 2 ) and ui = Yll=o Vr- By a similar way we have 



[•CO 

Jo 



10 

The limit random variable is 



Z(u)dn=2^e ~^ £r (u t+1 - ui) . 



1=0 



. J- + 
u 



I-+I+ ' 

with obvious notation. To understand the behavior of the Bayesian estimator, we simulate 
the Bayesian estimator for 20, 000 times with the histogram of u given in Figure 4. From this 
figure, it is clearly seen that the Bayesian estimator converges to the expected value zero. 
The sample mean is —0.0026 with a standard deviation 0.028. It is interesting to see that 
this simulation results are consistent with the theory that the limit variances of the MLE and 
BE 

1 N 1 N 

d 2 MLE ^J^/Z^l and d BE ~ Yl 

q=l q= 



satisfy 



d%[LE > d%E- 



Note that it follows from the symmetry of the limit process, the random variables u and u 
satisfy = = E^tt. 

For 20,000 simulated estimators, we obtain the limit variances as d 2 MLE = 22.83 ± 0.68 
and d 2 BE = 16.79 ± 0.39. These values concur with the theoretical results that the Bayesian 
estimator outperforms the MLE. 

Figure 3. 
Figure 4- 

To examine the finite sample performance of the test statistics, we computed the critical 
values of the limit distributions based on the MLE and the BE using the same set of param- 
eters as given in Figures 3. The sizes are chosen for commonly used test statistics and the 
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0.025 


0.05 


0.075 


0.1 


0.90 


0.925 


0.95 


0.975 


MLE 


-9.66 


-6.64 


-5.28 


-4.46 


4.46 


5.38 


6.87 


9.84 


BE 


-8.44 


-6.29 


-5.07 


-4.27 


4.21 


5.09 


6.26 


8.43 


Simulated Values 


-9.70 


-6.88 


-5.48 


-4.64 


4.90 


5.85 


7.55 


10.28 



Table 1: Critical values for A = 0.5, 9 = 2, a = 1, p\ = 0.95 and p2 = 0.15. 



limiting values are given in the first two rows of Table 1. As can be seen, both the MLE 
and BE procedures perform reasonably well and are in close agreement. Furthermore, the 
numbers in the last row of Table 1 are the critical values computed from the test statistics 
in ([5]), which are directly simulated from model using the same set of parameters. It is 
seen that the critical values generated from the simulated statistics agree remarkably well 
with the critical values computed from the limit distributions in Table 1 based on MLE. In 
summary, Table 1 demonstrates the usefulness of the limit distributions in computing the 
critical values. If one needs to conduct a test for another set of parameters, then a similar 
table can be computed and the programming code is available from the authors upon request. 



4 One-sided threshold 

In the nonlinear time series literature, the threshold AR model usually takes the form (see, 
e.g., [3], [HI and [E]) 

^j+i = Pl^j^-{Xj<&} + P2Xjl{ Xj >$} e j+i (22) 

Note that the study of these one-sided threshold models is no more complicated than ([I]) 
because the log-likelihood ratio 

depends on the stochastic process (for fixed u > 0) 

n-l 
3=0 

(see pip ), which is approximated by the process 

n-l 

y: (u) = {p 2 $ 2 + W^+i) K { o <Xj< o + *y 

3=0 

Here we use the same notations as before and add the condition that \p\\ < 1. Comparison 
with ()12p shows that the factor sgn (Xj) no longer exists and this simplifies matters much in 
the application of the limit theorems. Specifically, similar to (|13|) . the corresponding limit 
for Y° (u) becomes 

N+(u) 

Y: (u) ==> Y + (u) = £ lp 2 V 2 + 2p#s+] , 

1=0 
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the only difference is: instead of 2/ 6), the intensity of the Poisson process N + (u) is 
A+ = /(0,0). 

The inequalities for the process Z n (u) obtained in Lemmas 12.11 and 12 , 21 can be obtained for 
Z n (u) of the process ([22]) exactly the same way as in this paper. Consequently, the asymptotic 
behavior of the Bayesian estimator n for model (122ft is the same as that described in the 
Theorem 12.11 with the slightly difference due to the form of the limit likelihood ratio Z (u), 
where the intensity of the Poisson process is now / not 2/ (0,0). 

5 Discussion 

Let us explain heuristically why the choice (j4]) for the MLE is better than other types of 
the form u 7 = ju m + (1 — j)um with 7 7^ 1/2,7 S [0, 1]. The interval [u m ,UM] can be on 
the positive, negative parts of R or it can be [it]~,«i~], where and uf are the first event 
of the Poisson processes ZV_ (■) and N + (•) respectively. If [u m ,«jw] = then the 

random variables — = ("1 and = £2 are independent exponential with the parameter 
A = 2/(0,0). 

(«r is the Z-th event of the Poisson 
process iV_ (•)) has random length Z and 

« 7 = + (1 - 7) = 7 (uj - C) + (1 - 7) ^ 7 = ^ 7 - 7C, 

where £ is exponential random variable with parameter A. For the positive part 
n 7 = 7 u+ + (1 - 7) «J. j = iuf + (1 - 7) + C) = uf + (1 - 7) C- 

Denote to be the probability that the maximum of the random process Z (•) is on the inter- 
val [ifj~,ii^]. The positive and negative intervals are equiprobable, hence their probabilities 
P-,p+ satisfy the relations p^ = p + = (1 — po) /2 = p. We then write 

B^u 2 = p_ E„ (uT - 7 C) 2 + po E,? (7C1 - (1 - 7) C2) 2 

+ p+E# (ut + (1 - 7) C) 2 = 2pE tf (n+) 2 + 2pB$ E„C 
+ ^(27 2 -27 + l) + ^(47 2 -47 + l) 
and direct calculations show 

min E$ (u 7 ) 2 = E$ (u) 2 . 
76 [0,1] 

Note that in this problem of parameter estimation, it is possible to introduce the notion 
of asymptotic efficiency of estimators. The lower bound on the risk of all estimators n for 
the quadratic loss function is as follows: 

lim lim sup n 2 E,j (0 n - 0) 2 > E# () (u) 2 . (23) 

<5->0 n->oo |i9-i9 |<5 

This bound follows from the results of the Section 1.9 in [13]. We just note that the 
second moment E$ (u) 2 is a continuous function of 0. 



If this interval is on the negative part, then 



U i + V U i 
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As E$ (u) 2 is the limit of the Bayesian estimator, we can think of these estimators having 
smaller limit error than the MLE (as in singular estimation problems). To prove this asymp- 
totic efficiency of the Bayesian estimators, we need to show that the convergence of the second 
moments is uniform in $ on compacts. The corresponding uniform estimates on the process 
Z n (•) can be easily verified and what remains to be done is to establish the uniform version 
of the convergence of finite dimensional distributions, which can also be verified. 

Another possible generalization is to consider the case where {£j}j>i are independent 
random variables with a known density function satisfying some regularity conditions. Then 
the estimator i} n (defined by ([2])) becomes the least squares estimator and (defined by ([3])) 
is no longer Bayesian, but becomes another estimator having desirable asymptotic properties. 
The behaviour of these estimators can be similarly studied and their limit distributions can 
be defined via the corresponding limit process Z (•) when ef are no longer Gaussian. 

Note that a continuous-time analogue of TAR model is prescribed by the following stochas- 
tic differential equation 

dX t = - Ql X t l {lXtl<l3} dt - q 2 X t l {lXtl ># } dt + adW t , < t < T, 

where Q\ ^ Q2 > 0. This model can be called Threshold Ornstein-Uhlenbeck (TOU) process 
and it can be considered as a continuous-time approximation of the discrete time model ([T]). 
The properties of the MLE and BE of the threshold 1? can be studied with the help of the 
technique developed in [T5] . 
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Figure 1: The kernel density function of f(9,2). 
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Figure 2: A sample path of Z[u). 
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